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1 Introduction 

A compact (not necessarily connected) Lie group G carries a (unique) bi- 
invariant probability measure. Using this measure, one can average orbits of 
actions of G on afhne convex sets to obtain hxed points. In particular, if G 
acts on a manifold M, G leaves invariant a riemannian metric on M, and this 
metric can sometimes be used to obtain fixed points for the nonlinear action 
of G on M itself. By this method, Elie Cartan proved that a compact group 
G acting by isometrics on a simply-connected manifold M of nonpositive 
sectional curvature always has a fixed point.Q Cartan's result was extended 
by Grove and Karcher [0] to arbitrary manifolds under the assumption that 
the G-action has an orbit which is sufficiently small relative to a distance 
scale provided by the geometry of M. 

In this paper, we develop a method for averaging nearby submanifolds 
in a riemannian manifold. This enables us to extend the Grove-Karcher 
theorem from points to submanifolds; i.e. we establish that, if a riemannian 



*Research partially supported by NSF Grants DMS-96-25122 and DMS-99-71505 and 
the Miller Institute for Basic Research in Science. 

^In Note III, n° 19 of Cartan proved the fixed point theorem for a finite group; he 
then remarked on p. 19 of that the same argument works for a compact group. See 
Theorem 13.5 in Chapter 1 of Helgason's book [jl6[ for a full proof in the compact case. 
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G-manifold has a compact submanifold N whose images under the G-action 
are sufficiently C^-close to one another, relative to the geometry of the pair 
{M,N), then there is a G-invariant submanifold near N. Extending the 
Cartan-Grove-Karcher nonlinear averaging method to submanifolds requires 
new estimates on the geometry of tubular neighborhoods. Our construction 
is based in an essential way on Whitney's idea|^ pi[ | of realizing submanifolds 
as zeros of sections of vector bundles. 

Our result fits nicely into the "stability" framework promoted by Ulam 
(see for example page 584 of [^], as well as Anderson's interesting discussion 
using nonstandard analysis in [^). Ulam asks when a mathematical object 
which "almost solves" a certain problem can be shown to be near an exact 
solution. In fact, our interest in the invariant submanifold problem was 
motivated by some cases of the following "almost-homomorphism problem" : 
given a map <j) : G ^ H between groups such that <l){gh) is close in some 
sense to (j){g)(j){h) for all g and h in G, can one conclude that there is a 
homomorphism near 0? This problem was posed by Ulam and solved by 
Hyers [^] for G = = M. It was solved for compact Lie groups G and H, 
with effective estimates, by Grove, Karcher, and Ruh |15] using the Grove- 
Karcher fixed-point theorem. When G is compact and H is the unitary 
group of a Hilbert space, it was solved by de la Harpe and Karoubi 

Our invariant submanifold theorem implies an almost-homomorphism 
theorem when G is finite and H is the group of diffeomorphisms of a compact 
manifold. Our ultimate aim is to extend the almost-homomorphism theorem 
to cases where G is a compact group and H is the group of smooth bisections 
of a Lie groupoid. (See Q.) Such an extension would imply very strong 
structure theorems for certain groupoids. We refer to the lecture fs^ for an 
overview of these potential applications and their connection with symplectic 
geometry. Details of these applications will appear in subsequent papers. 

An important step in our proof requires averaging in a Grassmann mani- 
fold. This can be accomplished by Cartan-Grove-Karcher averaging, but we 
get estimates better suited to our purposes by identifying subspaces with 
projections and averaging the projections. The averaging of projections has 
already been treated, in finite and infinite dimensions, by a number of au- 
thors over several decades, including de la Harpe and Karoubi in the paper 

^See Chapter V of the cited article. After finishing this paper, I looked again at this 
reference and discovered that Whitney used a center of mass construction, too! 
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cited above. We review some of this work and derive the estimates we need 
in an Appendix. 

The body of the paper is organized as follows. In Section 2, we define a 
C^-distance between submanifolds of a riemannian manifold and, in terms 
of this distance, state our main theorem on invariant manifolds. Theorem 
2.2, which follows immediately from the averaging theorem. Theorem 2.3. 
Section 3 develops estimates on geometric objects in tubular neighborhoods; 
the reader may wish to skim this section and return to it after reading 
Section 4, where we first outline the proof of the averaging theorem and 
then fill in the details. In Section 5, we discuss some alternate approaches 
to the averaging of submanifolds which may be interesting in their own 
right, though we have not yet succeeded in implementing any of them. We 
conclude in Section 6 with a discussion possible extensions of our results, 
and the relation of our work to other areas. 

In view of potential applications to almost actions of compact Lie groups, 
we note that most of our work extends without modification to the case 
where M is a Hilbert manifold, though the submanifold must remain 
finite-dimensional. There is just one issue, the application of degree theory, 
which now restricts our main result to finite-dimensional M. 

We have not tried to find optimal values for the many constants in the 
estimates in Sections ^ and ^. Rather than denoting these constants by 
letters, though, we have chosen numerical values which are valid and which 
may be recognized when they reappear. The resulting constants in the main 
theorems are therefore very far from optimal. The reader is invited to find 
better ones. 

A word on notation: we will write •) to denote all of the many dis- 
tance functions used in this paper, decorating it with subscripts only when 
necessary to avoid ambiguity. We will use bars | • | for lengths of paths and 
vectors, and double bars || • || for operator norms. 

Many people have offered useful suggestions in the course of this work. I 
would particularly like to thank David Aldous, Eugenio Calabi, Marc Cham- 
berland, Jost Eschenburg, Robert Greene, Mikhail Gromov, Karsten Grove, 
Hermann Karcher, Yi Ma, Yoshiaki Maeda, and Jamie Sethian. 
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2 Bounded geometry and distance between sub- 
manifolds 



Let be a closed submanifold of the riemannian manifold M. The normal 
bundle uN C TM consists of those tangent vectors along N which are 
orthogonal to TN. The open ball bundle of radius r in vN will be denoted 
by u'-N. 

We define the normal exponential map cxpjy to be the restriction to 

vN of the exponential map exp : TM M. (If M is not complete, exp^, 
like exp, is defined only on a proper open neighborhood of the zero section.) 
The normal injectivity radius ijv of is 

sup{r| expjv (is defined and) is an embedding on u'^N}. 

(When iV is a point, its normal injectivity radius is just the usual injectivity 
radius of M at that point.) The restriction of exp^^ to i/'^iV is still an 
embedding; we call its image TN the tubular neighborhood of N. For any 
r G (0, ijv], the set 

T^'N = {xe M\d{x, M)<r} = N) 

will be called the r-tube about N . 

The bundle retraction in uN is transferred by exp^ to a retraction ttat : 
TN — > N which maps each point of TN to the nearest point in N . The 
fibres of Tr^v will be called the normal slices for N. 

2.1 The Grassmann bundle 

We will denote by GM the Grassmann bundle over M, whose fibre over each 

point X is the (disconnected) manifold of all finite-codimensional subspaces 
of the tangent space TxM. We give each fibre the Finsler metric described 
in the Appendix. 

These fibre metrics define a supremum distance on the space of sections 
of GM over any subset of M. 

The Gauss map of a submanifold N is defined as the section of GM 
over A^ whose value at each point is the normal space to A'^. This section 
extends in a natural way to a section F^v : TN GM defined over the 
tubular neighborhood of A^: we take the normal spaces to A^ and parallel 
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translate them along all the geodesies normal to N. The image of r^r, 
when thought of as a vector bundle over the tubular neighborhood, will be 
called the vertical bundle and its orthogonal complement (obtained by 
translating the tangent spaces of N) the horizontal bundle. 

The vertical bundle contains the tangent vectors to the normal geodesies 
but does not in general coincide with the tangent bundle along the normal 
slices. We will refer to the latter as the quasi-vertical bundle. (The two 
bundles are compared in Section |3.3| .) The orthogonal complement to the 
quasi-vertical bundle will be called the quasi-horizontal bundle. 

2.2 Bounded geometry and distance 

To define a geometrically meaningful distance between submanifolds, we 
should measure distances and angles in the same units. Since angles are 
dimensionless, we do this by choosing a positive number c which functions 
as a "unit of inverse length." Given c, we say that a pair (M, A^) consisting 
of a manifold M and its submanifold N has geometry bounded by c if: 
(i) IN > 1/c; (ii) the sectional curvatures of M in the 1/c-tube about are 
bounded in absolute value by c^; (iii) the injectivity radius of each point in 
the tube is at least 1/c. If N is compact, {M,N) necessarily^ has geometry 
bounded by some c. 

By rescaling the metric, we can always convert geometry bounded by c to 
geometry bounded by 1. (Note that rescaling the metric does not change the 
metric on the Grassmann manifold.) We will therefore restrict the statement 
of our results to this case, leaving it to the reader who so desires to put the 
c's back in. A pair (M, A^) with geometry bounded by 1 will be called a 
gentle pair. 

Remark 2.1 Condition (iii) in the definition of bounded geometry is used 
in only a couple of places below. Perhaps our results can be obtained without 
this condition. One example where the condition is violated is a geodesic 
circle A^ in a very thin cylinder M. 

If A^ and A^' are submanifolds of M such that A^' lies in TN and is the 
image under exp^ of a section a{N, N') of i^N, we define their (nonsym- 
metric! but see Remark |1|) C^-distance d{N, N') as follows. We assign 



Even if Af has infinite dimension! 
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two numbers to each to each x' £ N': the length of the geodesic segment 
T from x' to the nearest point x on and the distance between T^'N' and 
the parallel translate of TxN along r. The C-'^-distance d{N, N') is defined 
as the supremum of all these numbers as x' ranges over A^'. Since the dis- 
tance between two subspaces equals the distance between their orthogonal 
complements (Corollary [A.6| ), we can use normal spaces instead of tangent 
spaces and interpret d{N, N') as the maximum of the supremum norm of 
a{N,N') and the distance between the extended Gauss map Tjv and the 
Gauss map T^/ as sections of GM over N' . 

We will occasionally use a distance as well, defined simply of the 
maximum over A^' of the distance to the nearest point of N. This distance 
is defined for any pair of compact subsets of a metric space. 

We can now state the main result of this paper. 

Theorem 2.2 Let M be a riemannian G -manifold for a compact group G, 
and let (M, A^) be a gentle pair. If d{N, gN) < e < 20S00 f'^^ 9 ^ G, then 
there is a G-invariant submanifold N with d{N, N) < 

This theorem follows immediately from a more general statement, anal- 
ogous to Grove and Karcher's center of mass construction [14| for points. 



Theorem 2.3 Let M be a riemannian manifold and {Ng} a family of com- 
pact submanifolds of M parametrized in a measurable wa'^ by elements 
g of a probability space G, such that all the pairs (M, Ng) are gentle. If 
d{Ng,Nfi) < e < 20000 for all g and h in G, there is a well defined center 
of mass submanifold N with d{Ng,N) < 136^/€ for all g in G. The cen- 
ter of mass construction is equivariant with respect to isometrics of M and 
measure preserving automorphisms of G. 



Remark 2.4 When G consists of two points with equal masses, we obtain 
a natural midpoint construction for pairs of nearby submanifolds in a rie- 
mannian manifold. We know of no simpler construction, even for pairs of 
embedded circles in euclidean space. 

^ We will not attempt to give a precise definition of this notion, which should involve 
measurable dependence of derivatives on parameters to permit the exchange of integration 
and differentiation. In the applications we have in mind, the parameter space is a manifold 
and the dependence is smooth. 
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The numbers 20000 and 136 in the theorems above can certainly be 
reduced if more care is taken in the many estimates used in the proof. On 
the other hand, it is not at clear whether the factor -^e can be replaced by 
a multiple of e itself. Our proof does show that the distance from A'^ (or 
Ng) to N is bounded by lOOe, but we do not see how to get an estimate 
linear in e for the derivative. (See Remark 3.14| .) 



3 Geometry in tubular neighborhoods 

In this section, we will establish some geometric estimates needed for proving 
the main theorem. 

3.1 Estimate on the second fundamental form 

Recall that the second fundamental form B oi N is defined for a normal 
vector V and tangent vectors Wi at p £ N hy 

By{wi,W2) = -{v,'Dw^W2), 

where W2 is a vector field tangent to N with W2{p) = D is the rieman- 
nian connection, and ( , ) is the riemannian inner product .0 The norm \\B\\ 
is then defined as the supremum of |i?^(i(;, it;)| as v and w range over unit 
normal and tangent vectors respectively, and their basepoint ranges over N. 

Since focal points along geodesies normal to a submanifold cannot occur 
within the normal injectivity radius, gentleness of (M, N) implies that all 
focal points to occur at a distance greater than 1 from N. This fact and 
Proposition below will imply a bound for the second fundamental form 
of N (Corollary ^ 



Proposition 3.1 Let v and w be unit normal and tangent vectors respec- 
tively at p £ N . Suppose that By{w,w) = —b for some b > and that the 

^We follow the sign convention used by Warner (following [^) and Eschenburg 
|ic[| . This convention is consistent with the interpretation of the second fundamental form 
as the "first variation of the metric," but most authors use the opposite sign, consistent 
with the direction of principal curvature vectors of curves in a submanifold. Lang 
introduces second fundamental forms with both signs as the off-diagonal entries in a single 
2x2 block matrix. 
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sectional curvatures of M are bounded above by —1. Then the geodesic seg- 
ment 7 with initial velocity v and length L contains at least one focal point 
to N as long asb>l/L + L/2. 

Proof. We use a standard argument to show that the index form of 7 (with 
appropriate boundary conditions) is not positive definite. 

Using parahel translation and a slight abuse of notation, we represent 
any vector field X along 7 by a function X : [0, L] TpM. The index form 
on the space of such X with X[t) £ TpN and X(L) = is defined by the 
integral: 

I{X,X) = B,{XiO),X{0)) + r {{X'{t),X'{t)) - {Rit)X{t),Xit))) dt, 

Jo 

where R{t) is the operator given via the Riemann curvature tensor as 

For the "linearly decreasing" vector field X(t) = (1 — t/L)w, 

I{X,X) = B^{w,w)+ [ {{l/L^)-{l-t/L){R{t)w,w))dt 

Jo 

= -h+l/L- I {1 -t/L){R{t)w,w)dt. 
Jo 

Since the sectional curvatures are at least —1, 

I{X, X) < -h+l/L + f (1 - t/L)dt = -h+l/L + L/2. 

Jo 

When h > l/L + L/2, I(X,X) < 0, so there must be at least one focal point 
on 7. 

□ 

Corollary 3.2 If(M,N) is a gentle pair, then the second fundamental form 
of N satisfies the bound \\B\\ < |. 

Proof. The distance from to any focal point is at least 1. If \By{w, w)\ > ^ 
for some unit vectors v and w, we can change the sign of v if necessary to 
assure that By{w,w) < — |, and Proposition 3.1 leads to a contradiction. 

□ 
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3.2 Comparison of curves on and near 

The following lemma shows that the retraction vrjv : TN N does not 
increase lengths too much. 

Lemma 3.3 Let (M, A^) be a gentle pair. Let a he a path in the r-tube about 
N for some r < ^. Then 

(cosr — I sinr)|7rAr o (t| < |o"|. (1) 

(The function in parentheses decreases with r and takes the values 1 atr = 0, 
.59 . . . at r = ^, and .15 . . . at r = ^.) 

Proof. Assume that a is defined on the interval [0, 1] and construct a "rect- 
angle" S : [0, 1] X [0, 1] ^ M as follows: for each s G [0, 1], s ^ T.{s, 1) is 
the minimizing geodesic from 7rj\f{a{s)) to ct(s), parametrized proportionally 
to arc length. Corresponding tangent vectors to a and ttn o a are thus the 
terminal and initial values of A^-Jacobi fields along normal geodesies. By 
Warner's generalization of the Rauch comparison theorem (|2^, Theorem 
4.3(b)), using the upper bound 1 for the sectional curvature and the bound 
-Bo-'(o) ^ (Corollary p. 21 ) on the second fundamental form of A", we 

obtain the inequality |o"'(s)| > (cosr — |sinr)|(7rAr o a)'{s)\ for all s, which 
gives (I). 

□ 

Corollary 3.4 Let {M,N) be a gentle pair, x and y points of the j-tube 
about N such that d{x,y) < \. Then 

d{'KN{x),iTN{y)) < 7d{x,y). 

Proof. A path of length less than ^ from x to y remains in the ^-tube about 
A^. Lemma gives the desired inequality since 1/.15 < 7, 

□ 

A similar argument gives: 

Corollary 3.5 For a gentle pair {M,N), let x and y be points of N such 
that dM{x,y) = a < |. Then 

duix.y) < dN{x,y) < . , dM{x,y) < 7dM{x,y). 

cos 2^ — f sm 
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Remark 3.6 This estimate and the ones to foUow can surely be sharpened, 
but the versions which we give suffice for our purposes. 



3.3 Vertical and quasi-vertical bundles 

The quasi-vertical bundle, tangent to the normal slices for N C M, is gener- 
ally not parallel along normal geodesies and thus does not coincide with the 
vertical bundle. We will estimate the distance between the two bundles by 
estimating the covariant derivative of the quasi- vertical bundle along normal 
geodesies. 

Proposition 3.7 For x G TN , the distance between the quasi-vertical and 
vertical spaces is at most ^d{x,N)'^. The distance between the projections 
on this spaces is at most ^d{x,N)'^. 

Proof. Tangent vectors to normal slices are the values of those Jacobi fields 
along normal geodesies vanishing on and with derivative there in vN. Let 
X{t) be such a field along the geodesic 7(t), parametrized by arc length. 
The initial derivative ^'(0) belongs to v^^^-^N; for convenience we take it 



to be a unit vector. As in the proof of Proposition 3.1, we identify all the 
tangent spaces along 7 with T^(^q^M by parallel translation. 

Since X(0) = 0, we write X{t) = tX'{0) + Z{t). Then Z(0) = Z'{0) = 0, 
and Z"{t) = X"{t) = —R{t)X{t) by the Jacobi equation. The curvature 
estimate < 1 gives first of all the estimate ||X(t)|| < sinht, so 

||Z"(t)|| < sinht. Integration gives \\Z'{t)\\ < cosht - 1 and then \\Z{t)\\ < 
sinht - t < t^/6. Then \\X{t)/t - X'{0)\\ < t^/6 and \\X{t)/t\\ > 1 - t^/6. 

The typical quasi- vertical unit vector {X(t)/t)/\\X{t)/t\\ is thus within 
distance 

tye ^ t^ 

1 - ^2/6 ~ 5 

of the vertical vector X'(0)/||X(t)/t||, from which it follows (for t <\) that 
the distance between the vertical and quasi-vertical spaces at x = 7(t) is at 
most 

sin-^(tV5) < = d{x,Nf/A, 

while the difference between the projections on these spaces is at most 
\d{x,N)\ 

□ 
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An estimate for the covariant derivative of the quasi-vertical bundle in 
non- vertical directions would simplify our later considerations, but we have 
not been able to find such an estimate]^ Instead, we will estimate the 



covariant derivatives of the vertical bundle and then use Proposition 3.7 to 
shuttle between the two bundles. 

Proposition 3.8 Let {M,N) be a gentle pair, Pjv its extended Gauss map, 
and Pr^ the field of orthogonal projections onto the image ofT]\j ( the vertical 
bundle). Then the covariant derivative DPr^^, a field of maps from TM to 
End TM , has operator norm bounded pointwise by 11 on the tube T^N. 

Proof. We estimate DPr^r along N in terms of the second fundamental 
form of A^, then we extend the estimate into the tubular neighborhood by 
differentiating along the normal geodesies. 
Along AT, D 

i,Prjv is zero if v is vertical, by the definition of r^v- If is 
horizontal, D^Pr^ is a symmetric operator which exchanges horizontal and 
vertical spaces. For vertical v and horizontal /i, it follows easily from the 
definition of the second fundamental form that ((D^Pr^)(w), h) = By{w,h). 
Along A^, then, ||DPr^|| = which is at most | by Corollary |3.2| . 

To estimate DPr^^ at a point x £ TN, we let 7 be the unit speed geodesic 
segment from 7r7v(x) to x and estimate D-y/(s)DPrjv. Let y be a unit length 
vector field on a neighborhood of 7 such that 7 is an integral curve of Y. 
For any unit vector v £ T^M, we can find a vector field Z near 7 such that 
[y, Z] = and Z{x) = v. 

Now DyD^Pr^ = D^DyPr^ + [R{Y,Z),¥r^]. But DyPr^ = 0, 
||y|| = 1, and ||Pr„|| = 1, so ||DYDzPr«|| < 2||i?|| \\Z\\, and hence 



||D,Pr^|| < | + ci(x,A^)-2||i?|| sup \\Z{-i{s))\\. 

0<s<d{x,N) 

Since the sectional curvatures are bounded in absolute value by 1, ||-R|| < 
I (see §6.1 in ^]), so it remains only to estimate the size of Z along 7. For 
this purpose, we construct a particular choice of Z by forming the rectangle 



(as used in Lemma 3^) consisting of minimizing geodesies from A^ to the 



points on the geodesic with initial vector Z{x) = v and letting Z be the 

®The difficulty seems to be related to the fact that a bound on curvature does not 
imply a bound on the derivatives of Christoffel symbols in normal coordinates; see the 
example on p. 34 of [h8|. 
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variational vector field. Z is an A^-Jacobi field along 7, so we can estimate 
it using Warner's comparison theorem |2^]. As in Lemma |3.3| , we conclude 
from Warner's theorem that ||Z(7(s))|| > (coss — | sins)||Z(7(0))|| for all 
s. Setting s = d(x,iV), we get: ||Z(7(0))|| < (cos (i(2;, A^) - | sind(x, A^))~^ 
which is less than 7 if < ^. 

Now the maximum of ||Z(7(s))|| might occur at an interior point 7(so) of 
the segment. If that happens, we must have T)ji(^so)^ -L -^(*o)- In that case, 
we can use Warner's theorem again, comparing with the situation where 
the initial manifold is totally geodesic, to get an estimate even better than 
before. 

Our conclusion is that supo<s<(i(2,^7v) ll-^(7("5))ll ^ 7, so that ||DPrjvll <J 
3 + i.2.|.7<ll. 

□ 

3.4 The potential function of a submanifold 

On the tubular neighborhood TN, the distance function from N, pn{x) = 
d{x,TrN{x)) = II exp]^^(x)||, is singular along N, but P/v = is smooth 
and has as a nondegenerate critical manifold (of absolute minima). We 
call P/v the potential function of N, thinking of N as the attracting 
manifold of a "linear" attracting force field. The points of N can also be 
recovered from P/v as the nondegenerate minimum points of its restrictions 
to the normal slices, to each normal slice, or equivalently as the zeros of 
the projection of the gradient VP/v into the quasi-vertical bundle. This 
description of will be most useful to us, since the projected gradient 
is transverse to the zero section in the quasi-vertical bundle. We will be 
proving Theorem |2.3| by identifying the invariant submanifold A^ as the zero 
set of an invariant object obtained by averaging VP/v over G and projecting 
into an averaged vertical bundle. 

We will need estimates on the hessian = DVP/v, a field of sym- 
metric bilinear forms on the tangent spaces of TN defined by Hfyf{X,Y) = 
(DxVP/v,y). Estimates for this hessian when A^ is a point are given in |^] 
(Proposition 6.4.6) and play an important role in applications of the center 
of mass construction. When dimA'^ > 1, the estimates are more compli- 
cated, since the hessian behaves quite differently in horizontal and vertical 
directions. 
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To calculate the hessian of Pjy, we adapt the method of Gray |13], who 
used a Riccati equation to study the second fundamental forms of the bound- 
aries of the r-tubes, i.e. of the level hypersurfaces of P/v. His analysis used 
the gradient Un of the function yOjv, which has the same level hypersurfaces. 
Since Un is the unit outward normal to these hypersurfaces, the hessian 
Sn = DC/at gives the second fundamental form of the tube boundaries. The 
covariant derivative of the tensor field Sn in the radial direction satisfies the 
Riccati equation 

Df/jv'S'Af = S% + Rn, 

where Rn is the field of operators obtained (as one does for the Jacobi 
equation) by contracting the radial vector field twice with the Riemann 
curvature tensor R of M; i.e. Rn{X) = R{Un,X)Un- The Riccati equation 
determines Sm in the tubular neighborhood once we know its behavior along 
N. But Sat, like Um, is not even defined along A^, so this initial behavior 
must be described asymptotically. This point is dealt with only briefly by 
Gray, so we paraphrase here the following result of Eschenburg ([0], §6.1). 

Proposition 3.9 Along a unit speed geodesic 7 emanating normally from 
N , the solution Sn of the Riccati equation has the asymptotic expansion 
S'7v(7(i)) = t^^S-i + So-\- 0{t), where S^i and Sq are in block matrix form 
with respect to the decomposition into horizontal and vertical subspaces as 
follows: 



S-i 



' I 


" 









So 












i?y(0) 



Unfortunately, the proof of this Proposition seems to belong to folklore 
rather than literature. Nevertheless, we will not write a proof here, either. 
(The proof is a nice exercise in the use of Fermi coordinates or Fermi frames 

HO 

We return now to the hessian = DVPat. Differentiating the relation 
VPat = pn'^Pn yields 

H]s = Un ®Un + PnSn, 

where Un'^Un is the operator of orthogonal projection onto the unit vector 
field Un- As the hessian of a smooth function, is defined throughout the 
tubular neighborhood, and its values along N are quite easy to compute. 
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In fact, since Pn is critical at all points of A^, we can compute the hessian 
at any such point as the second derivative matrix in any coordinate system 
for which the coordinate vector fields form an orthonormal basis at that 
point. From this observation, it is easy to check (see below) that H]\f is 
block diagonal with respect to the normal-tangent splitting of TM along 
N. On the tangent bundle TN, is zero, while it equals the riemannian 
metric on the normal bundle vN. 

Since it is not but rather the singular field 

Sn = {1/pn){Hn -Un® Un) 

which satisfies the Riccati equation, the initial conditions required for deter- 
mining or estimating a solution also include the normal (covariant) derivative 
of H]\f along A^. It is here that the second fundamental form of N appears, 



as we saw in Proposition 3.9 and one may easily check in the simple example 



where is a circle in the euclidean plane. 



3.5 Estimates for the hessian 



We will use the following estimate of Eschenburg and Heintze ||ll|] on solu- 
tions of Riccati equations to estimate the hessian of the potential function. 



Theorem 3.10 (Eschenburg-Heintze ||TT|]) Let Si{t) and S2{t) be 
maximal solutions of the Riccati equations 

S j — S j ~\~ j 

defined for < t < tj with values in symmetric matrices such that: 

(1) U := S2 — Si has a continuous extension to t = with U{0) > 0; 

(2) Rj{t) is a smooth function with R2{t) > Ri{t) for all t > 0. 
Then ti < t2, and Si < S2 on (0,ti). 

We recall that an inequality of the form P < Q between symmetric matrices 
means that the difference Q — P is positive semidefinite. It implies that 
the maximal eigenvalue of P is less than that of Q. As a consequence, the 
inequality —cl < P < cl for a positive number c implies the bound ||P|| < c 
on the operator norm of P. 
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We will use Theorem 3.1C to estimate the hessian of the potential func- 
tion for a gentle pair, by comparison with manifolds of constant curvature 
±1. Our first step will be to examine and estimate the solutions of the 
Riccati equation when R{t) is a constant multiple of the identity matrix. 

It suffices to consider the scalar case, i.e. the equations 

s' = s^±l 

for a real- valued function s{t) defined for t > 0. We must consider two kinds 
of boundary conditions, corresponding to the vertical and horizontal cases 
in the geometric situation. 

For the vertical case (directions normal to and their parallel translates 
along normal geodesies), we choose the solution with a pole at 0, namely 
— cot t when the sign of the curvature ±1 is positive and — coth t when it is 
negative. For convenience, we will sometimes use the notation cot+ for cot 
and cot_ for coth, as well as cot± to denote whichever is appropriate; thus 
the general solution can be written as 

Sver{t) = COt± t. 

For the horizontal case (directions tangent to N and their parallel trans- 
lates along normal geodesies), we take the solution with initial value b (which 
can be positive or negative), which is 

, , tan± t + b 

Shor[t) 



1 =F fetan-t t 



We will now estimate the function h{t) = ts{t) on the interval [0,1], 
since it is this product rather than s{t) alone which gives the hessian of the 
potential function. 

For the vertical direction, we have /iver(0 = t cot± t (notice that the pole 
has disappeared, as it should). With positive curvature, h{t) decreases from 
1 at t = to cot 1 ~ .642 ... at t = 1. With negative curvature, h{t) increases 
from 1 at t = to coth 1 ~ 1.313 ... at t = 1. Thus, we have the estimate 

.64 < /ivcr(i) < 1-32 for < t < 1. 

Notice that this estimate also applies to the hessian in the radial direction, 
which is identically 1, even though the relation between the hessian and the 
Riccati solution is more complicated in that case. 
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For the horizontal direction, we wih need only an upper bound, and there 
is no pole to worry about, so we can estimate Shor first and multiply by t 
afterward. For positive curvature. 



•Shor(i) 



tan t + b 
1 — fttant 



is monotonically increasing in b, so we can set 6 = |, by our estimate on the 
second fundamental form under the assumption of bounded geometry. This 
gives us 



1 - I tant 



I tan t < ^, i.e. 



To keep the denominator under control, we will require that 
tant < |, which is satisfied when t < .321 .... To keep the numbers simple, 
we will require that t < j, which leads to the upper bound of which for 
simplicity we replace by 3. This gives us the estimate 

\hhor{t)\ < 3t for 0<t<\. 

We are left with the negative case. But since tanh grows more slowly 
than tan, the estimate from the positive case still applies. 



Summarizing the estimates above and applying Theorem |3.10| , with the 



initial conditions given by Proposition 3.9 gives the following: 



Proposition 3.11 Let {M,N) be a gentle pair. At a point of M with dis- 
tance r < i from N, the hessian Hj^ when expressed in block matrix form 
with respect to the decomposition into vertical and horizontal parts satisfies 
the inequality: 



' MI 








-3r/ 



' 1.321 








3r/ 



(2) 



Corollary 3.12 On the \-tube around N , \\Hiq\\ < 1.32. 

Proof. The statement follows from the bound —.75/ < H]\f < 1.32/. 



□ 
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Corollary 3.13 At a point of Ad with distance r < ^ from N, for a hori- 
zontal vector V and a vertical vector w, \Hn{v,w)\ < 3^/r \\v\\ \\w\\. 

Proof. We may assume that v and w are unit vectors. By Proposition 



3.11 , \H]\[{v,v)\ < 3r and .64 < Hn{w,w) < 1.32. In addition, for any real 
number t, Hiy{tv + w,tv + w) < 3rt^ + 1.32. Thus the quadratic function 
t t'^{Hiy{v,v) — 3r) + 2tHiy{v,w) + Hiy{w,w) — 1.32 is negative for all 
t and hence its discriminant is negative. So 4Hj\^{v,w)'^ < 4{H]\f{v,v) — 
3r){HN{w,w) - 1.32) < 4 • 6r • 0.68. Therefore, H%{v,w) < 4.08r, and so 
\Hn{v,w)\ < 3^/r. 

□ 



Remark 3.14 The square root in Corollary |3.13 is responsible for the pres 



ence of ^/€ rather than e in the estimates in the main theorems. It seems 
plausible that the square root could be removed here, since the off-diagonal 
block of H]^ is zero along N, and so it must grow linearly with r on any 
particular manifold. Because of the singular nature of the relation between 
Sjy and H]\f, though, we have not been able to get a linear bound which is 
uniform for all gentle pairs. 

3.6 Geometry relative to nearby submanifolds 

In this section, we estimate the difference between geometric objects in the 
tubular neighborhoods of two nearby submanifolds. To begin, we show that 
the retractions to nearby submanifolds are close to one another. 

Lemma 3.15 Let (M, N) and (M, N') be gentle pairs such that d{N, N') < 
\, and letx e M with d{x,N) < \. Then (i(7rAr(x), 7rAr7r^(x)) < Ud{N,N') 
and (i(7rAr(x),7r^(x)) < l5d{N,N'). 

Proof. Let w = TT]\f{x), y = Tr]\f/{x), and z = iTN{y)- Let be the unit 
tangent vector at y to the geodesic segment yx, let r] be the parallel translate 
of ^ to z along the geodesic segment yw, and let C be the nearest unit vector 
to r] in the normal space I'zN. Let r and s be the endpoints of the geodesic 
segments zr and zs with initial directions r] and C respectively, each with 
the same length as yx. We will show below that d{x, r) and d{r, s) are small. 
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The resulting estimate on d{x,s), combined with Lemma 3.3, will give us 
an estimate on d{w, z), which will lead to the result. 

To estimate d{x, r), we observe that x and r are the endpoints of geodesic 
segments departing from y and z with initial tangent vectors related by 
parallel translation along the geodesic yz. It follows that d{x,r) < Xd{y,z), 
where A is a bound on the growth of a Jacobi field with initial derivative 
along a geodesic segment of length d{x,y), under the assumption that 
sectional curvatures are > — 1. 

We write a for d{N,N') and /3 for d{x,N). Since d{x,y) < d{x,N) + 
d{N,N') = /? + a, we can take A = cosh(/? + a), and since d{y,z) < 
d{N, N') = a we have 

d{x, r) < a cosh(/3 + a) . 

For d{r,s), we use the fact that r and s are the endpoints of geodesic 
segments with the same initial point, length at most a + /?, and initial 
(unit) tangent vectors making an angle of at most a with one another (since 
d{T]\f{z),Tj\i'{z)) < d{N,N') = a). The usual Jacobi field estimate, using 
again the lower bound of —1 on curvature, gives 

d{r, s) < a sinh(/? + a) , 

and hence 

d{x, s) < a(cosh(/3 + a) + sinh(/3 + a)) = ae^^" < aes < 2a. 

By Corollary |3.4| , d{w, z) < 14a. Since d{y, z) < a, the triangle inequal- 
ity gives the required estimate for (i(7rAr(x), 7r^(x)). 

□ 

Next we estimate the difference between the tangent vectors from a point x 
to the submanifolds and A^'. 

Lemma 3.16 Let {M,N) and {M,N') be gentle pairs such that d{N,N') < 
and let x € TN be such that d{x,N) < |. Then the difference between 
the initial tangent vectors S^^ix) and ^^/{x) of the minimizing geodesies from 
X to 7Tn{x) and ^^'{x) respectively is less than ^ d{N,N'). 
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Proof. As in Lemma |3.15 , we let a = d{N,N'). According to that lemma, 



we can connect tt^^x) to 7rN'{x) by a path a of length less than 15a < 
Since the distance from x to is less than ^, all the points of a are within 
a distance 1 of x, so that we can lift a by the exponential map at x to a 
path a in the tangent space TxM and thereby produce a smooth family {7^} 
of geodesic segments joining x to the points of a. Note that a{0) and a{l) 
equal ^iv(x) and S,N'{x) respectively. 

We estimate the length ratio by using the Jacobi fields along the 

segments 7^. Since the sectional curvature in M is bounded above by 1, the 
segments have length less than 1, and sinx/x > | for < 2; < 1, we have 
the estimate 

H/l^l>i- 

I • 15a = is therefore an upper bound for the length of a and hence for 
the difference between £,Nix) and £,N'ix)- 

□ 

Finally, we estimate the distance between the extended Gauss maps of 
nearby submanifolds. 

Proposition 3.17 Let {M,N) and {M,N') be gentle pairs such that 

d{N,N')< \. Ifxe TiN, then d(r7v(a;), T^v' (x)) < 250d(A, A'). 

4 

Proof. Let y = 'irN'{x), w = 7riv(x), and z = TTNiv) = T^NT^N'ix)- The 
polygon xyzwx is defined to consist of segments which are geodesies in M, 
except for zw, which is taken to be a geodesic in A. We lift the polygon 
xyzwx to GM as follows. On the segments xy and yz, we take the parallel 
section Ftv'; note that 

d{rN'{z),rN{z)) <d{N,N') <a. 

At this point, we jump to T]\f{z), which we parallel translate to w along 
the geodesic zw in A. Arriving at w, we have a space whose distance from 
Tn{w) is less than | (our bound for the second fundamental form of A^') 
times the length \zw\. Finally, we jump to Ti\r{w) and parallel translate it to 
X along the normal geodesic wx to arrive at Tfyf{x). The distance between 
the starting and ending values of our section is bounded above by the sum 
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1 



of the sizes of the jumps at z and w and 2^5 times the distance from the 
identity to the operator H of holonomy around xyzwx. 

Let a = d{N,N'). By Lemma ^.15 , d{z,w) < 14a; hence, \zw\ < 



7- 14a < 100a by Corollary and so the jump at w has size at most 150a. 
The distance d{I, TL) from the identity to TL is bounded by the product of the 
curvature (< 1) and the area of a surface spanning the polygon. We build 
such a surfac^ with the minimizing geodesies from x to the points on the 
segments yz and zw. All these points are within a distance d{x, N) + 2a < 
I + 2a < I of X, and Jacobi fields along these geodesic segments with the 
value zero at x are always increasing in length (no focal points), so the area 
of the surface is at most 

\{d{x, N) + 2a){\yz\ + \zw\) <{\ + l)(a + 100a) < 130a. 

Adding the estimates for the jumps and the holonomy, we conclude that 

d{TN{x),TN'{x)) < a + 150a + 2"^ . I30a < 250a. 

□ 



Remark 3.18 We can derive from the results of this section a crude 
"quasisymmetry" estimate for the distance: if d{N,N') < ^, then 
d{N',N) < 250d{N,N'). 

4 The main theorem 



In this section, we will prove Theorem 2.3, of which Theorem 2.2 is a direct 



consequence. For convenience, we repeat the statement of the former: 



Theorem 2^ Let M be a riemannian manifold and {Ng} a family of 
compact submanifolds of M parametrized in a measurable way by elements 
g of a probability space G, such that all the pairs {M,Ng) are gentle. If 
d{Ng,Nh) < e < 2o5oo 3 '^^^ ^ there is a well defined center 

of mass submanifold N with d{Ng,N) < 136-^6 for all g in G. The center 
of mass construction is equivariant with respect to isometrics of M and 
measure preserving automorphisms of G. 

^This is one of the pla ces where we use Condition (iii) in the definition of bounded 



geometry; see Remark 2.1 
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Proof. We begin with an outline of our proof and leave details to a series 
of lemmas which follow. 

For each g in the parameter space G, the potential function Pg = P/Vg 
is defined on the tubular neighborhood TNg. On the intersection T of 
these neighborhoods, the average over g of the functions Pg is a function 
P whose gradient is the average of the VPg- The Gauss maps Tg = Tj^^ 
are all defined on T as well, and since the manifolds Ng are close to 
one another, the average of the Tg (by the construction in the Appendix) 
is a well-defined section T of the Grassmann bundle. The zero set N of the 
projection PpVP of VP into the averaged vertical bundle r(T) is obviously 
an invariant subset of M. 

To see that is a smooth manifold which is close to all the Ng, we 
choose a base point e in G and look at the restriction of PpVP to each of 
the normal slices for N(,. To study these restrictions, we make PpVP into 
a vector field along the normal slices by applying the projection Pg^ into 
the quasi- vertical bundle (tangents to the normal slices) for A^e- Since the 



vertical and quasi- vertical bundles are close, by Proposition |3/7|, Pq^ gives 
an isomorphism between the two bundles, and A^ is again the zero set of 
the vector field ^ = Pg^PpVP along the normal slices. (We note that ^ is 
neither invariant nor a gradient, but this is not a problem.) 

Combining several estimates from Section ^, we will show that all the 
zeros of Pg^PpVP are nonsingular with index -|-1 along the normal slices, 
and that Pg^PpVP points outward along the sphere of radius lOOe in each 
slice. As a result, there is a unique zero inside that sphere, and the collection 
of all these zeros forms a manifold which is transverse to the normal slices 
and, in fact, is close to A^e- Since e was arbitrary, we get C"^ closeness of 
A^ to all the A^, and we are done. 

Remark 4.1 The arguments in the preceding paragraph require M to be 
finite dimensional. To extend the result to the infinite-dimensional case will 
require further analytic estimates on the projected gradient, so that some 
infinite-dimensional degree theory can be applied. 

We will divide the detailed proof into a series of steps, using the assump- 
tion that the {M,Ng) are gentle pairs with d{Ng,Nfi) < e for all g and h. 
We will add restrictions on e as necessary. Lemmas will be embedded in the 
proof; the end of the proof of a lemma will be indicated by the symbol A. 
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STEP 1. To begin, we establish a region on which all the potential functions 
are defined. 



Lemma 4.2 For any r such that e < r < 1, = Dg^c'^'^ contains 
T'^~^Nh for every h in G. 

Proof. If X G T^^'^Nfi, then there is a y G A'^^ with d{x, y) < r — e. For any 
g, since d{Ng, N^) < e, there is a z in Ng with d{z, y) < e. Then d{x, z) < r, 
and so x G T^'Ng. Thus T^'~''Nh is contained in each T'^Ng. 

A 

We will leave r undetermined for the moment and study geometry in the 
G- invariant subset T^. 

STEP 2. The Gauss maps Tg = TNg are all defined on T^. We wiU 
show that, when r and e are sufficiently small, they are close enough to one 
another so that their values can be averaged by the method of the Appendix 
to produce an equivariant Gauss map T. This averaged Gauss map is close 
to both the vertical and quasi- vertical bundles of the Ng. 

Lemma 4.3 If r < ^ and e < then the average T{x) of the subspaces 

^gix) <Z TxM can he constructed by Proposition A. 8 for each x G , with 



the Finsler distance d{T {x) ,T g{x)) < lOOOe for all g in G. The mapping 
r : TM is smooth and G- equivariant, and the construction ofV from 

the family {Ng} is equivariant with respect to isometrics of M and measure 
preserving automorphisms of G. 

Proof. Since x has distance less than | from each Ng, and d{Ng,Nh) < e. 
Proposition ^.17 gives the estimate d{Tg{x),rh{x)) < 250e < j. Proposition 



then produces in each tangent space TxM an averaged subspace T{x) 
with d{T{x),rg{x)) < sin~-^(3 • 250e) for all g. Since sin~^2;/x < | when 
< X < |, we conclude that d{T{x),Tg{x)) < lOOOe. 

The equivariance properties of F and its construction follow from the ab- 
sence of arbitrary choices in the construction. The smoothness of F follows 



from the corresponding assertion in Proposition A.8| (using local trivializa- 
tions). 

A 
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Lemma 4.4 For each x in T*" and g in G, the distance from T{x) to the 
quasi-vertical space Qg = keiTxT^Ng is less than lOOOe + jr^. 



Proof. Add the estimates in Propositions 3.7 and A. 5. 



STEP 3. The averaged potential function 



P= Padg 



G 



is defined on T^. Since the metric is invariant, 



A 



VP = / VPg dg. 

IG 



Lemma 4.5 Ifr<jQ,e<-^, and x G T*", then 



50' 



45 



\\VPix)-VPg{x)\\<-e. 
Proof. By the usual averaging argument, it suffices to check that 



n 45 

\VPa-VPh\\ < — e 



for all g and h in g. Observing that — VPg(x) is just the initial tangent vector 
of the minimizing geodesic from x to Ng, we get the result immediately from 
Lemma 0.16 . 



A 



STEP 4. As indicated in our outline of the proof, we define N to be the set 
of zeros in T'' of the projected gradient vector field PpVP. It is obviously a 
G-invariant subset of M. To study this zero set, we fix a reference element 
e of G and replace PpVP by its projection Pg^PpVP into the quasi-vertical 
bundle Qe for N^- Since this projected field is tangent to the normal slices 
for A'^e, it will have at least one zero in each normal slice if we can show that 
it points outward along some sphere, i.e. if it has a positive inner product 
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with VPg- As long as the hypotheses of Lemma 4.3 are satisfied, Lemma 
4.4 imphes that the distance ||Pp — ¥qJ\ is less than sin(l + j^) < 1, so 
the projection from the averaged normal bundle into the normal slices is 
an isomorphism, and the zero set is not changed. To simply formulas, we 
will denote the "doubly projected gradient" vector field Pq^P|tVP(x) by the 
symbol V. 



Lemma 4.6 If e < 2^ 99e < d{x,Ne) < then the inner product 
{V,VPe{x)) is positive. 



Proof. Let p = d{x^Ng). Under the assumptions of the Lemma, it follows 



from Lemma [4.4| and Proposition A. 4 that 
and from Lemma that ||VP - VPe|| < < 
Then 



-II < sinfi^ + J-^ < i 



(V,VPe(x)) = (PQ,PrVP(x),VPe(x)) = (VP(x),PrVPe(x)) 

= (VPe(x) + (VP(X) - VPe(x)), [Pq, + (Pp - PqJ] VPe(x)) 
= (VPe(x),VPe(x)) + (VPe(x),(Pr -PQjVPe(x)) 
+ (VP(X)- VPe(x)),VPe(x)) 
+ (VP(x) - VPe(x)), (Pp - PQjVPe(x)) 



> -[p-\p+^p- P+^p-\- p\> y>Q. 



A 



Following Lemma 4.6, we require e < 2^ and set r = lOOe. Then the set 



N intersects each normal slice at least once in 



STEP 5. We will show that each zero of V is nondegenerate with degree 1 
when restricted to a normal slice. It will follow that is a smooth section 
of the retraction tt^^, with the distance from N^. to N less than lOOe. 
The estimate used to establish nondegeneracy will also give us the desired 
bound on the distance d{N,Ne). For our considerations in this section, 
we require e < 20000' _ 

At each point x of N, the projected gradient V, being zero at that 
point, has an intrinsic derivative DV which is a linear map from TxM 
to the quasi- vertical space Qe at x. We will show that W is surjective 
with positive determinant when restricted to Qe- From this, it will follow 
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from the implicit function theorem that is a smooth manifold which is the 
graph of a smooth section of the retraction vrjv^. By our construction in Step 
5, the distance from to N less than lOOe. In fact, since any element 
of G could have been chosen as e, and the definition of is independent of 
this choice, the distance from each Ng to less than lOOe. 

Since the tangent space to A/^ at x is the kernel of PV, we will be able 
to estimate the distance by comparing the sizes of the restrictions of W 
to Qe and its orthogonal complement. Although DV is defined only at the 
zeros of V, it agrees there with an object defined throughout T^, namely the 
covariant derivative DV, considered as the operator v i— > D^V. (Although 
the values of DV do not generally lie in Qe, they do lie in there at the zeros 
of V, as we shall see.) 

To simplify notation, we will denote the operation of averaging over G 
by A. Then 

DV = T>¥Qjj.VAPg 



iBFQj¥j.AVPg + PQ,(DPr)^VP3 + ¥qJj.BAVP, 



a 



We analyze the last expression above term by term. 

The first term, (DPQ^)PpV^Pg, is zero along the zero set A^ of the 
projected gradient W^VAPg = PpVP. 

For the second term, we have ||PQ^(DPp)^VPg|| < ||DP|t|| sup^ ||VPg||. 
By Proposition 3.8, this is bounded above by 11 times the distance from Ng, 



i.e. by 11 • lOOe = llOOe. 

In the last term, Pg^PpD^VPg = ^Pg^PpDVPg, the factor DVPg is 
the hessian -ffAfg, which we will denote by Hg. Using Proposition 3.11| , we 



will estimate the expression Pg^PpD^VP^ by looking at the inner prod- 
uct (Pg^PpD^iVPg, It;) = Hg{v,^jfw), where w is an arbitrary quasi-vertical 
vector. We analyze separately the cases where v is quasi-vertical and quasi- 



horizontal. Since Proposition |3.11| uses the splitting into vertical and hori- 



zontal spaces, we have to estimate several "correction terms." 

Lemma 4.7 For any quasi-vertical vector v at any point of N, 
{VV{v),v)>§-Jvr. 

Proof. For v quasi-vertical, 

Hg{v,F^w) = Hg{FQ^v,F^w) (3) 
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Hg{Fr,v,Fr,w)+Hgi(FQ^^-Fr,)v,Fr,w) 
+Hg{¥r,v,iF^-Fr^)w)+Hg 



r,)t;,(Pr-Pr,H. 



We estimate first the differences between projections. By Proposition 



3.7 and Proposition 3.17 



< 



I'r, II + llPre - Pr, || < ^ix, N^f + 250e 
< 



^ 1 I 100 \2 I 250 ^ J_ 
^ 5 V 20000/ 20000 ^ 50 



By Lemma 4.3, 



V II < lOOOe < 



1000 

20000 ~ 20 



h. From tliis we liave in 



particular that 

||Pr,t^||>(l-||Pr,-Pr. 



\v\\ > 1 



250 \ „ „ 49 „ , 

h > — \\v\ 

20000 J " " 50 " ' 



Applying Proposition 3.11 now gives the estimate //^(Pr f,Pr v) > 



•64(|g) ||w|| > f Ibll • Since, by 3.11 , \\Hg\\ < 1.32, the remaining three 
terms in the last expression in Equation |3| are bounded above in norm by 
1.32||f Ip times and respectively, so that Hg{v,FYv) > ^||v|p for 

each quasi- vertical vector v. 

We return now to Equation ^ to estimate the derivative of V in the quasi- 
vertical direction. From that equation, we know that, for quasi-vertical v, 
{'DV{v), v) = (D„V) is the sum of three terms, the first of which vanishes and 
second of which is bounded in absolute value by 1100e||f |p along A^. The 
third term is the average over G of quantities which we have just shown to 
be bounded below by ^||t'|p. Since e < 20S00' have {'DV{v),v) > ^IblP 
for all quasi- vertical v. 

A 



It follows from Lemma 4.7 that the derivative PV is invertible on the quasi- 
vertical space, so that all the zeros of V are nondegenerate along the normal 
slices. Moreover the positivity of {'DV{v),v) shows that the degree of each 
zero is 1, so there is exactly one zero along each such slice. Thus, is the 
image of a smooth section of the normal bundle to A^e; i-e. of the tubular 
neighborhood retraction tt^^. To estimate the derivative of this section, and 
hence the distance from A^e to N, we need to estimate VV when applied 
to quasi-horizontal vectors. 
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Lemma 4.8 For any quasi-horizontal vector v at any point of the zero set 
N ofV, \\VV{v)\\ < eO^e \\v\\. 

Proof. We will estimate the inner product {'DV{v), w) = for quasi-horizontal 
V and quasi-vertical w. Since we already have the bound of 1100e||t;|| \\w\\ 
for what corresponds to the first two terms of Equation H, what remains 



to be estimated is Hg{v,¥Yw). For this we use Corollary 3.13, making 
the usual slight adjustment between horizontal and quasi-horizontal vectors, 
analogous to Equation ^. 



We estimate this expression term by term, as in the proof of Lemma 4.7, 
using Corollary p.l3| (and the duality of Corollary A. 6): \Hg{v,¥Yw)\ < 
[3Vr+1.32(ir2 + 250e-hl000e-h(ir2 + 250e)1000e)] \\v\\ \\w\\. Using the as- 
sumptions r = lOOe and e < 20S00 ^^'^ some arithmetic we get Hg{v,FY^w) < 
(30^-M670e)||u|| \\w\\. 

As with the previous lemma, we return to Equation |^ which expresses 
VV{v){v) = D v{v) as a sum of three terms, the first of which vanishes and 
second of which is bounded in norm by 1100e||t;|| along A^. The third term 
is the average over G of operators which we have just shown to be bounded 
above in norm by (30-y/e -|- 1670e)||f ||. As a result, we find ||PV(w)|| < 
(30V^-F2770e)||v|| < 60Ve|b||. 

A 



Combining the results of Lemmas 4.7 and 4.8, we conclude that the 
operator from to Qe whose graph is the tangent space TxN has norm at 
most 60^^ < 135^. By Corollary d{Tj^,Q^) < tan-i(135Ve) < 
135y/e. Combining this estimate with Proposition |3.7| , we find that the 
Finsler distance from A^ to the parallel translate of the tangent space to 
Ng at the foot of the normal from x is less than 136y^. This Finsler distance 
is one of the two quantities entering in the definition of the distance from 
Ne to A^. The other is simply d{x,Ne), which by Step 4 is bounded above 
by r = lOOe < ^/e. Since e could have been any element of G, we conclude 



that d{Ng,N) < 136^/e for all g in G, and Theorem 2.3 is proven. 

□ 
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5 Other ways of averaging submanifolds 



Before (and while) obtaining the present proof of Theorem 2^, we tried 
several other ways to average submanifolds. None of them have succeeded 
yet, but there may be means of circumventing the difficulties which we 
found. In each subsection below, we will describe a proposed procedure for 
averaging a pair of submanifolds; we leave it to the reader to general each 
one to arbitrary weighted families. 



5.1 Pursuit 

Given submanifolds A'^i and A/'2, we may move each point on Ni half-way 
along the geodesic segment to the nearest point on the other submanifold. 
The resulting two submanifolds should be closer together than the original 
pair, and we may then iterate the process and hope that the pair converges 
to a single submanifold. Variants on this method would involve going only a 
small fraction of the way from each submanifold to the other at each step, or 
letting each submanifold evolve continuously by the (time dependent) vector 
field pointing toward the nearest points on the other one. 

We did not succeed in obtaining the a priori bound on derivatives which 
would insure convergence for any version of this procedure. It might be 
interesting to test the idea with numerical experiments. 



5.2 The space of submanifolds 

One could consider A'^i and as points in the space of submanifolds, and 
then apply Grove-Karcher averaging there. (For a pair of submanifolds, this 
would just mean going half-way along the geodesic from one manifold to the 
other.) The problem here is that there is no natural differentiable struc- 
ture on a Banach manifold of unparametrized submanifolds. In fact, if one 
uses the natural coordinate systems given by sections of normal bundles, the 
transition maps fail to be differentiable at points which do not have extra 
smoothness. This difficulty is related to the fact that multiplication and 
inversion are not differentiable maps for any Banach manifold of diffeomor- 
phisms. Perhaps there is still a tractable variational problem for paths in 
the space of submanifolds for which two nearby points are connected by a 



unique minimizer. (See Section 6A for a related remark.) 
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5.3 Distribution theory 

The average of A'^i and is well defined as a de Rham current on M, i.e. 
as a continuous linear functional on the differential forms of degree equal to 
the dimension of the Ni (at least if the submanifolds are orientable). This 
current is closed and is "approximately a manifold" in some sense. Perhaps 
there is a theorem which allows one to "condense" such a current to an 
honest manifold. We don't even know how to begin proving such a theorem. 

Alternatively, since the submanifolds inherit riemannian metrics from 
M, one could consider their average as a measure on and, once again, 
try to turn this measure into one supported on a submanifold. This idea 
seems even harder to implement than the one using currents, since one does 
not easily read the differentiable structure of the submanifolds from the 
measures. 

6 Final remarks 

6.1 Is the average gentle? 

Unfortunately, our method gives no information on the extrinsic curvature of 
N , so cannot establish that (M, A^) is a gentle pair. In particular, we have no 
way to estimate the opposite distances d{N,Ng). It would be interesting to 
know whether the averaging of gentle pairs (even gentle curves in euclidean 
3-space) could produce a highly "wrinkled" submanifold. Perhaps one of the 
alternative averaging methods suggested in Section ^ would be preferable in 
this respect. 

We note that, for a pair of hypersurfaces with equal weights, our averag- 
ing method just produces the hypersurface of points which are equidistant 
from the original two. In this special case, there is no "loss of regularity," 
because the average hypersurface can be found directly from the potential 
functions of the original two, rather than from their gradients. 

6.2 Possible extensions of the theorem 

Is there a more global version of Theorem |2.2| , which would subsume Cartan's 
fixed point theorem (see the Introduction). In addition to the assumption 
of nonpositive curvature, one would presumably need to impose conditions 
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on the submanifold N, for instance that its normal exponential map be a 
difFeomorphism . 

It should be possible to extend our theorem to the case where the am- 
bient manifold M is infinite-dimensional. This extension would be essential 
for applications to almost- actions of non-finite compact Lie groups. (See the 
Introduction.) If M is a Hilbert manifold, the only place where more work 
needs to be done is in Steps 4 and 5 of the proof of the averaging theorem, 



where degree theory is used. (See Remark O.) If M is a general Banach 



manifold, we would also need to extend our study of the Grassmann man- 
ifolds to this case. This would be complicated by the fact that orthogonal 
projections are no longer available. 

There should also be versions of our theorem in symplectic geometry, 
e.g. for isotropic, lagrangian, or coisotropic submanifolds. If there are not, 
the obstructions to solutions should be interesting. 



6.3 Morphing 

Jamie Sethian pointed out that a solution to our averaging problem for a 
pair A''! and N2 of submanifolds would, on repeated application, give a fam- 
ily Nt of submanifolds defined for all binary fractions t G [1, 2]. Presumably, 
this could be extended by continuity to all real t S [1, 2], thus giving a defor- 
mation, or "morphing", between the two submanifolds. (Alternatively, we 
could vary the weight on the parameter space {1,2} to move from one sub- 
manifold to the other.) Conversely, any "reversible" procedure for morphing 
one manifold into another would give a solution to the averaging problem. 



6.4 Mass transport 

If the two submanifolds Ni are parametrized by the same manifold A^, the 
midpoint problem becomes trivial, since we can simply take the midpoints 
of the geodesic segments joining correspondingly parametrized points. (Of 
course, some estimates would still be needed to show that the resulting set 
of midpoints is a manifold.) Conversely, a solution of the midpoint prob- 
lem gives a diffeomorphism between A'^i and N2, obtained by projecting the 
average manifold onto both of them. David Aldous pointed out to me that 
this problem of finding a "distinguished" diffeomorphism between two sub- 
manifolds may be thought of as a special case of the Monge-Kantorovich 
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mass-transport problem (see, for example, jl^), in which an optimal cor- 
respondence is sought between the supports of two measures on the same 
space. Unfortunately, the correspondences found in most known solutions 
of this problem are multiple-valued. 



6.5 Evolution of submanifolds 

The representation of submanifolds as zeros of projected gradient functions 
may be useful for describing the evolution of manifolds under flows, gener- 



alizing the level set method for hypersurfaces. Ruuth et al [27| have studied 
such evolutions numerically. On the other hand, Ambrosio and Soner 
study these evolutions by representing a submanifold as the zero set of its 
distance or potential function. Although they use a single function and not 
a section of a bundle, they must analyze the derivatives of the potential 
function and the curvature of tube boundaries just as we do. 



A Appendix: Finsler distance and averaging in 
Grassmann manifolds 

We review in this section some geometry of Grassmann manifolds. Specif- 
ically, we will describe a center-of-mass construction derived from the av- 
eraging of projections, and we will analyze this construction in terms of a 
natural Finsler metric on the grassmannian. Many of the results here are 
probably not new, but there seems to be no convenient reference for them 
in the form which we need. 

We will work with finite dimensional subspaces of (possibly infinite di- 
mensional) Hilbert spaces. By duality, we will extend our results to sub- 
spaces of finite codimension (such as the normal spaces to finite dimensional 
submanifolds), but this is as far as we can go. The study of grassmannians 
of general subspaces is much more subtle. (See, for example. Chapter 7 of 

HO 

For our purposes, a Finsler manifold will be a Banach manifold with a 
continuous field of norms on the tangent spaces. The norms should define 
the topology but need not be smooth or strictly convex. These conditions 
are enough to define length of curves and an inner metric (see ||2^, §16) 
compatible with the topology, though nearby points may be connected by 
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many length-minimizing paths, such as in the case of the L°° ("taxicab") 
metric on the plane. 

A.l The Grassmannian and its metric 

For a real Hilbert space E and k = 0,1,2,... we denote by Gk{E) the 
Grassmann manifold of /c-dimensional subspaces of E, and by G{E) the 
manifold having the Gfc(£')'s as its connected components. Each Gk{E) is a 
homogeneous space under the natural action of the orthogonal group 0{E); 
the isotropy of an element F C E is the direct product 0{F) x 0{F^). Each 
tangent space TpG{E) is naturally isomorphic to the space Hom(F, i^-*-) of 
continuous linear maps from F to F^ . 

Endowing each tangent space Hom(F, F-^) with the operator norm gives 
an 0(ii^)-invariant Finsler metric on G{E). We will use this metric rather 
than the riemannian metric (also 0(£')-invariant) coming from the inner 
products {Qi,Q2) = Tr(QiQ2) tangent spaces. The Finsler distance 

d{F, F') between two subspaces is then defined as the infimum of the lengths 
of paths joining F to F'. 

We embed G{E) into the space S{E) of self-adjoint endomorphisms of E 
by mapping each F to the operator P{F) = of orthogonal projection onto 
F. The image of Gk{E) under this embedding consists of the self-adjoint 
projections with trace k. The tangent space to P{G{E)) at P{F) consists 
of those q in S{E) such that Fpq = q{I - Vp). The image of Q G TfG{E) 
under the derivative TpP has the block diagonal form 

Q* 
Q 

with respect to the decomposition E = F (B F^. 

The Hilbert norm y^Tr(g*g) of q is -v/2 times that of Q, but the operator 
norms of q and Q are exactly the same. Thus P is an isometric embedding 
of the Finsler manifold G{E) into the Banach space ^(S).^ 

We will also make use of the antisymmetric operator 



KQ) 



-Q* 
Q 



'The 



Proposition A.^ below. 



P keeps the length of curves fixed, but it generally decreases distances; see 
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in the Lie algebra o(-E') of 0{E). We call it the horizontal lift of the 
tangent vector Q, because the horizontal lifts of all Q £ Tf{G{E)) form 
a complement to the isotropy subalgebra o(-F) x o(-F-'") of F. By right 
translating the horizontal space around 0{E), we obtain a connection in 
the principal bundle 0{E) Gk{E). Thus, each path {Ft} in G{E) has a 
horizontal lift {gt} in 0{E) such that Ft = gt(Fo), and dFt/dt and dgt/dt 
have the same norm. 

It will be useful to compare the Finsler distance with some other notions 
of distance between subspaces. We have the following results. 

Proposition A.l If F and F' are subspaces of the same (finite) dimension 
in E, then the Finsler distance d{F,F') is equal to the distance do{F,F') 
between the unit spheres of F and F' , considered as submanifolds of the unit 
sphere in E (with the standard riemannian metric). 



Corollary A. 2 The Finsler distance between any two elements of Gk{E) is 
at most 7r/2. 



Corollary A. 3 The distance dQ{F,F') is symmetric in F and F' . 



Proposition A. 4 If F and F' are subspaces of the same (finite) dimension 
in E, then \\Ff -Fp'W = smd{F, F'). 



Proof of Proposition A.l. To begin, we note that, by an arbitrarily small 



perturbation of F, we can insure that 

FnF'^ = {0}. (*) 

It will suffice, then, to prove our equality for pairs having property (*). 
(When it is not satisfied, do{F,F') = tt/2.) 

We will show first that d{F,F') < do{F,F'). By our assumption (*), 
with respect to the decomposition E = F (B F^ , F' is the graph of a linear 
map (j) : F ^ F^. The non-negative symmetric operator on F has k 
eigenvalues which we take to be the squares of 

Ai > . . . > Afc > 0. 
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We denote an orthonormal basis of eigenvectors with these eigenvalues by 
ei,...,efc, and we call the angles 6j = tan~^ Aj the canonical angles^ 
between the subspaces F and F\ 
Since 

{(t>ej,(t>ek) = {(t>*(l)ej,ek) = Xpjk, 
we get an orthonormal basis for F' by taking the vectors 



Cj + (pCj Cj + (pCj 



l + A? yiTtan 



(cos Oj){ej + (pCj). 



We get an orthonormal set in F-^ by taking fj = {1/Xj)<pej whenever Xj ^ 0. 
For the remaining values of j, we set fj = 0. 

Our basis elements for F' can now be written as cos BjCj + sinOjfj, and 
we can now connect F and F' by the path 

Ft = cos{tej)ej + sm{tej)fj, 0<t<l. 

The velocity of this path at t = is the map Cj ^ djfj, which has norm 
di, the largest canonical angle. Since the Ft form a trajectory starting at 
-Fi for a 1-parameter group of isometries (generated by the skew-symmetric 
transformation which takes ej to Ojfj and is zero on {F © (pF)-^), the speed 
of the path is constant, so its length is 9i. Hence d{F,F') < 9i. But 
Si < dQ{F,F') because 9i is the distance from cos 6*161 +sin0i/i S F' to the 
unit sphere in F, so we have proven that d{F,F') < do{F,F'). 

For the reverse inequality, we will show that d(){F,F') is bounded above 
by the length of any path from F to F'. Let {Ft} be such a path, {gt} 
its horizontal lift to the orthogonal group. Any unit vector in F' is giv for 
some unit vector v in F; the distance from giv to the unit sphere in F is 
estimated by using the path {gtv}: 

d(v,giv) < I" |||(,..)|| dt < I' |||,,|| dt = £ W^t dtl 

which is the length of the path {Ft}. 

^ These angles have a long history, going back to Jordan We refer to for an 

extensive discussion of this history, and add as well the references |Q| and . Finally, we 
note that Porta and Recht ji^ have quite explicitly studied some aspects of the Finsler 
geometry of grassmannians. They represent subspaces by the involutions 2Pf — /, so their 
Finsler metric is twice ours. 
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□ 



Proof of Proposition A. 4. Let 6i > ... > 0^ be the canonical angles 
between F and F'. According to the preceding proposition, d{F,F') = 6i. 
In an orthonormal basis beginning with the orthonormal set ei, /i, 62, /2, • • • 
constructed in the proof above, the projection Fp/ is block diagonal with 
2x2 blocks of the form 



cos 



sm 



followed by zero blocks, while ¥p has the same form with the Oj^s replaced 
by 0. The difference Fp — Fp/ is built of diagonal blocks 



HQ) 



1 



COS 



COS sm ( 



sm 



smf 



sm a, 
- cos ( 



COS ( 

- sin t 



each of which has norm sin^j, so 

||P^ - p^,|| = sin 01 = sind(F,F') 



□ 



The identification of d{F, F') with the largest canonical angle also leads 
immediately to the following result. 

Corollary A. 5 // F' is the graph of a linear map u from F to F-^, then 



tan 



\u\ 



d{F,F') 

It is easy to go back and forth between subspaces and their orthogonal 
complements. 

Corollary A. 6 If F and F' are elements ofGk{E), and E is finite dimen- 
sional, then d{F,F') = d{F^,F'^). 

Proof. Note that the projections on a subspace and its orthogonal comple- 



ment sum to the identity operator. Now apply Proposition A. 4. 



□ 



Remark A. 7 Thanks to Corollary |A.6| (or simply using Proposition |A.4| as 
a definition of the distance), we can define the distance between subspaces 
of finite codimension in a Hilbert space to be the distance between their 
orthogonal complements. 

We also note that our distance functions are unaffected by rescaling of 
the inner product on E. 
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A. 2 Center of mass in the Grassmannian 



We turn now to the averaging of subspaces. Using Grove-Karcher averaging 
in the grassmannian makes it more difficult to obtain estimates of the kind 
we need, since it is hard to control the behavior of the Finsler distance along 
geodesies. Instead, we average the orthogonal projections associated to the 
subspaces. This approach was used already by de la Harpe and Karoubi in 
[^, but their work is more complicated and their estimates less sharp than 
ours because their projections are not necessarily orthogonal. 

Suppose, then, that we are given a family Fg of elements of Gk{E) 
parametrized measurably by elements g of a probability space G. If we 
average the projections ^Fg, we obtain a self-adjoint operator which is not, 
in general, a projection. However, if the Fg are sufficiently close together, 
this operator is an "almost projection," and we will see that it can be ap- 
proximated in a canonical way by a self-adjoint projection having rank k. 
Here is the precise result. 

Proposition A. 8 Let Fg be a family of elements of Gk{E) parametrized 
measurably by elements g of a probability space G such that d{Fg,Fh) < e 
for all g and h in G. As long as e < |, we can define an "average" element 
F G Gk{E) such that d{Fg,F) < sin""*^ 2e for all g in G. This averag- 
ing construction is equivariant with respect to isometrics of E and measure 
preserving automorphisms of G. If the Eg also depend smoothly on other 
parameters fx, the average F also depends smoothly on these parameters. 
Furthermore, if e < j, the magnitude of the derivative ^ is at most 8 
times the supremum over g of the magnitudes of the derivatives -qjj- . 



Proof. We note first of all that, by Proposition |A.4 W^Fg — ^F^W < For 
the average 

P= [ Ff, dg 



JG 

in the space of self-adjoint operators on E^ a simple computation shows that 
||P-PfJ| < e for ah g in G. 

We will now modify the approximate projection P to make it into an 
orthogonal projection. The estimate ||P— Pi?^ || < e tells us that the spectrum 
of P is contained in intervals of radius e about and 1. Since e < ^, we 
can apply to P the holomorphic function /i, defined on the complex A plane 
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with the hne Re A = ^ removed, having the value to the left of the line 
and 1 to the right. The operator h{P) is a spectral projection of P which 
can be written as an integral around a circle 7 of radius ^ — <5 around A = 1, 
where 5 can be an arbitrarily small positive number. 



The application of /i to P amounts to moving the spectrum to and 1 while 
keeping the invariant subspaces the same, so that \\h{P) — P\\ < e. 

Denoting by F the image of h{P), we have h{P) = P-^, and ||P-pT— P^^|| < 
\\h(P)-P\\ + \\P-FFj\ <e+e = 2e< 1 for all 5 in G. Since HP^^-PfJI < 1, 
the spaces F and Fg have the same dimension (see [20|, Theorem 6.32), so 
F G Gk{E). By Proposition d(F, Fg) < sin^^ 2e. 



The equivariance of our construction is obvious, since all the steps were 
canonical. To establish smooth dependence and estimate derivatives with 
respect to parameters, we continue our analysis of the operator function h by 
observing first that differentiation under the integral sign gives smoothness 
(in fact analyticity) of h. To estimate the derivative, we assume that the 
spectrum of an operator family P{n) lies in the union of discs of radius e 
around and 1 and compute: 



Hence, 



||^^||<i-(leng.h,)™p||(AI-P(„))-p ll^ll. 

As the radius of 7 approaches |, the length of 7 approaches vr, and so 

. dhjPip,)) ^ 1^ 1 u dPjis) ^ 2 u dPif^) u 
" dfi " - 27r^(i -e)2" dfi "-(l-2e)2" d^i 

As long as e < |, we have 
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and we shall be content with this crude estimate (though the factor 8 can 
probably be brought close to unity as e becomes small). 

We can now finish the proof of our Proposition. Let the spaces Fg depend 
smoothly on ^. Since the map F i— > P^t^ is a Finsler isometry, ||— ^^|| = 

Averaging does not increase norms, so H^^It^H < sup„||^^^||. By 



+ • + II 9? II II^'^'f II ^ o ll^'P's I 

our previous estimate, ||^|| = < 8 sup^ II "a^ I 



□ 



References 

[1] Afriat, S.N., Orthogonal and oblique projectors and the characteristics 
of pairs of vector spaces, Proc. Camb. Phil. Soc. 53 (1957), 800-816. 

[2] Ambrosio, L., and Soner, H.M., Level set approach to mean curvature 
flow in arbitrary codimension, J. Diff. Geom. 43 (1996), 693-737. 

[3] Anderson, R. M., "Almost" implies "near". Trans. Amer. Math. Soc. 
296 (1986), 229-237. 

[4] Bishop, R.L., and Crittenden, R.J., Geometry of Manifolds, Academic 
Press, New York, 1964. 

[5] Buser, P., and Karcher, H., Gromov's almost flat manifolds, Asterisque 
81, Societe Mathematique de France, Paris, 1981. 

[6] Cannas da Silva, A., and Weinstein, A., Lectures on Geomet- 
ric Models for Noncommutative Algebras, Berkeley Math. Lecture 
Notes, Amer. Math. Soc, Providence, 1999; electronic version at 
: //math . berkeley . edu/~alanw . 

[7] Cartan, E., Groupes simples clos et ouverts et geometric riemannienne, 
J. Math. Pures Appl. 8 (1929), 1-33. 

[8] Cartan, E., Legons sur la Geometric des Espaces de Riemann, 
Gauthier-Villars, Paris, 1928. 

[9] de la Harpe, P., and Karoubi, M., Representations approchees d'un 
groupe dans une algebre de Banach, Manuscripta Math. 22 (1977), 
293-310. 



38 



[10] Eschenburg, J., Comparison theorems and hypersurfaces, Manuscripta 
Math. 59 (1987), 295-323. 

[11] Eschenburg, J.-H., and Heintze, E., Comparison theory for Riccati 
Equations, Manuscripta Math. 68 (1990), 209-214. 

[12] Gangbo, W., and McCann, R.J., The geometry of optimal transporta- 
tion. Acta Math. 177 (1996), 113-161. 

[13] Gray, A., Tubes, Addison-Wesley, Redwood City, 1990. 

[14] Grove, K., and Karchcr, H., How to conjugate C^-close group actions. 
Math. Z. 132 (1973), 11-20. 

[15] Grove, K., Karcher, H., and Ruh, E.A. Group actions and curvature. 
Invent. Math. 23 (1974), 31-48. 

[16] Helgason, S., Differential geometry and symmetric spaces, Academic 
Press, New York, 1962. 

[17] Hyers, D. H., On the stability of the linear functional equation, Proc. 
Nat. Acad. Sci. U.S.A. 27 (1941), 222-224. 

[18] Jost, J., and Karcher, H., Geometrische Methoden zur Gewinnung von 
a-priori-Schranken fiir harmonische Abbildungen, Manuscripta Math. 
40 (1982), 27-77. 

[19] Jordan, C, Essai sur la geometric a n dimensions. Bull. Soc. Math. 
Prance 3 (1875), 103-174. 

[20] Kato, T., Perturbation Theory for Linear Operators. 2d ed., Springer- 
Verlag, Berlin-New York, 1976. 

[21] Lang, S., Fundamentals of Differential Geometry, Springer, New York 
(1999). 

[22] Paige, C.C., and Wei, M., History and generality of the CS decomposi- 
tion, Lin. Alg. Appl. 208/209 (1994), 303-326. 

[23] Petkanschin, B., Intcgralgeometrie 6. Zusammenhange zwishcn den 
Dichtcn dcr lineare Unterraume in n-dimensionalen Raum, Abh. Math. 
Sem. Hamburg. Univ. 11 (1936), 249-310. 



39 



[24] Porta, H., and Recht, L., Minimality of geodesies in Grassmann mani- 
folds, Proc. Amer. Math. Soc. 100 (1987), 464-466. 

[25] Pressley, A., and Segal, G., Loop Groups, Oxford University Press, New 
York, 1986. 

[26] Rinow, W., Die Innere Geometrie der Metrischen Rdume, Springer, 
Berlin, 1961. 

[27] Ruuth, S.J., Merriman, B., Xin, J., and Osher, S., Diffusion-generated 
motion by mean curvature for filaments, preprint, UCLA, 1998. 

[28] Ulam, S., Sets, Numbers, and Universes, MIT Press, Cambridge, 1974. 

[29] Warner, F.W., Extensions of the Rauch comparison theorem to sub- 
manifolds, Trans. Amer. Math. Soc. 122 (1966), 341-356. 

[30] Weinstein, A., From Riemann geometry to Poisson geometry and 
back again. Lecture at Chern Symposium, MSRI, 1998, available at 
http : //msri . org/publications/video/ contents . htiiil| . 

[31] Whitney, H., Differentiable manifolds, Annals of Math. 37 (1936), 645- 
680. 



40 



